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ABSTRACT 


In this paper we perform a novel analysis of quantum 
Gaussian channels in the context of weak measurements. 
Suppose Alice sends classical information to Bob using 
a quantum channel. Suppose Bob is allowed to use only 
weak measurements, what would be the channel capac¬ 
ity? We formulate weak measurement theory in these 
terms and discuss the above question. 


I. INTRODUCTION AND MOTIVATION 


Gaussian channels were first presented in Shannon’s 
work [1]. They appear naturally in classical communica¬ 
tion theory [2]. A Gaussian channel is a channel where 
the output Y is the sum of the input X and a noise 
Z, where the noise is drawn randomly from a Gaussian 
distribution usually with 0 mean and cr^ variance [2]. 
In this work the noise is introduced through the weak 
measurement process at the end point of the channel, 
rather than as a result of some external process. 


The capacity of quantum Gaussian channels was cal¬ 
culated in [3, 4] and was further studied in [5-7]. In this 
paper, we will rely on these works by choosing particular 
cases of the above theory, but at the same time, we will 
adopt rather different perspective and applications from 
the known ones. 

Weak measurements were first presented by Aharonov 
et al. in [8]. In a nutshell, a measurement is weak if the 
measuring apparatus, which contains a quantum element 
we call a ‘needle’, is weakly coupled to the quantum sys¬ 
tem we wish to measure, and therefore yields a set of 
eigenvalues covered with random noise. The needle is 
usually normally distributed with high variation. 


So far, weak measurements were not discussed in this 
language of quantum channel capacity. We believe that 
such a discussion can clearly draw the line between strong 
and weak measurements in quantum mechanics. By 
‘weak measurement’ we refer to a lower bound on the 
variance of the needle (see below), therefore all the above 
is true for a large set of measurements strength. 

Our main result can be stated as follows: let {xi} be 
the set of letters used by Alice. Let A be a random 
variable describing the distribution of letters Alice tries 
to send. Suppose Alice is employing a quantum chan¬ 
nel, sending a density matrix pi for each of the letters 
Xi- Next, Bob is performing a weak measurement pro¬ 
cess [9] of some Hermitian operator A at the end point of 
the channel. We assume therefore that the {xi} are the 
eigenvalues of A, this should be part of the protocol Alice 
and Bob share. Let Z denote the normal Gaussian distri¬ 
bution V(0,tT^) of Bob’s measurement needle before the 
weak coupling. Then the maximal rate of transmission 
through the channel will be: 


uraxH{X ■. X + Z), (1) 

X(A) 


where H{X ■. X + Z) is the classical mutual information 
between the two classical random variables, and the max¬ 
imum is taken over all discrete probability distributions 
X on the set {xi} of eigenvalues of A. The above formula 
resembles the one presented by Shannon for a Gaussian 
channel [1], however here it has a different meaning. 

Moreover, let 

uiaxE{X^) = Px. (2) 

X{A) 


Then the capacity of the weak channel will be bounded 
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from above by: 

Taax.H{X : X + Z) <-log (+ . (3) 

x{A) 2 V / 

This is the main result of the paper which, to our under¬ 
standing, is both novel and insightful. Note that in the 
classical case all signals passing the channel are a priori 
bounded by the power of the signal. Here, the maximal 
variance of all probability distributions on A takes the 
place of the power. This calls for a more general defini¬ 
tion of a weak channel where one can use any Hermitian 
matrix in the weak measurement process. Note also that 
the upper bound in 3 is achieved in the classical case. 
The difference between the classical and quantum bound 
is due to the definition of the channel. In classical chan¬ 
nels the Gaussian noise is presented during the passage 
of information from the transmitter to the receiver. In 
the Quantum weak Gaussian channel (QWGG) case the 
‘noise’ is presented at the last step of the communication, 
during the process of weak measurement. In particular, 
the codewords in the classical case are produced by sam¬ 
pling from a normal distribution, whereas in the quantum 
case the codewords are sampled from a finite discrete en¬ 
semble of density matrices (see also the proof of part (a) 
of the main theorem in section III) hence the difference 
in the bound. The method used in this paper combines 
arguments from discrete channel theory with arguments 
from continuous Gaussian channel theory [2]. 

Two factors therefore define the upper bound on the 
rate of information flow of QWGC; the maximal variance 
of H, and cr^, the variance of the needle. 

The rest of the paper is organized as follows: in sec¬ 
tion II we introduce some necessary weak measurements 
preliminaries and prove two useful lemmas. In section III 
we present the notion of quantum weak gaussian channels 
and prove the main result of the paper. 


II. WEAK MEASUREMENT PRELIMINARIES 


Weak measurement should be treated as a generaliza¬ 
tion of quantum strong measurement. In weak measure¬ 
ment theory both the system and the measuring needle 
are quantum systems [9-11]. Weak measurement con¬ 
sists of two steps. In the first step we weakly couple 
the quantum measurement device to the quantum sys¬ 
tem via a von Neumann interaction Hamiltonian. In the 
second step we ‘strongly’ measure the needle. The col¬ 
lapsed state of the measurement device is referred to as 
the outcome of the weak measurement process. For a 


measurement to be weak, the standard deviation of the 
measurement needle should be larger than the difference 
between the eigenvalues of A. We will now describe this 
process in details. The procedure resembles the von Neu¬ 
mann scheme for performing ‘strong’ measurements [14], 
however, here we use a very weak entanglement between 
the system and the measurement device (see also [9-11]). 

Let \4)d) denote the wave function of the measurement 
device. When represented in the position basis it will be 
written as: 


1^) = I'Ad) = [ (t){y)\y)dy, (4) 

where y is the position variable of the measuring needle. 
Let Yd be the position operator such that Yd\y) = y\y) 
(here, we use Yd to distinguish the operator Yd from its 
eigenvector \y) and eigenvalue y, the subscript d is used 
for measuring device). We will also assume that initially 
4>{y) behaves normally around 0 with some variance cr^: 


(j){y) = (27r(T^) ^ . (5) 

We will later (strongly) measure \(j)d), *-e. collapse the 
device’s needle to get a value which is the weak measure¬ 
ment’s outcome. 

Let S denote our system to be measured. Suppose A 
is an Hermitian operator on the system S. Suppose A 
has N eigenvectors Js^) such that A\xi) = Xi\xi). 

Consider the general state vector j^) expressed in the 
eigenbasis of A-. 


H)=^ai\xi)- ( 6 ) 

i 


Consider the interaction Hamiltonian ([10, 11]): 


H = Hint = git)A 0 Pd. (7) 

Here git) is a coupling impulse function satisfying: 
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[ g{t)dt = 1, (8) 

^0 

Let il = let p be a density matrix on the 

main system Q, let ]</))(</>] be the density matrix for the 
needle space of Bob, let Py = \y){y\. We can then write: 

where T is the coupling time and Pd is the operator conju¬ 
gate to Yd such that [Yd, Pd] = ih. For the measurement 
to be considered weak, the coupling strength should be 
much smaller than the standard error a of the measure¬ 
ment needle. 

p{Y = y)dy = tr{PyU{p® {\(j)){(j)\))U'^Pl)dy 

We shall start the measurement process with the vec¬ 
tor: 

The following two simple lemmas are immediate and are 
needed for the definition of h£'^{Y). 

Lemma II. 1 

1^) ® l<^(2/)), (9) 

If p is pure, p = ]'*/') (l/’l (where \ip) as in Eq. 6 above), 
then 

in the product space of the two systems. Then we apply 
the following time evolution based on the weak measure¬ 
ment Hamiltonian [10, 11]: 

1 P J/2 

P{yi <Y<y 2 ) = J2 

“ V27rcr Jyi 

® \<^{v)). (10) 

Proof: See [16] for example. ■ 

It is easy to see that on each of the vectors 0 \4>{y)) 

the Hamiltonian H takes Yd to Yd + Xi, (Heisenberg evo¬ 
lution): 

The distribution of Y is multi-normal with coefficients 
and means Xi. 

Lemma II.2 

= j "-[H,Yd]dt = x, (11) 

Under the above assumptions; if p ia a density matrix, 
then 

(see [15, section 8.4]). The corresponding transformation 
of the coordinates of the wave function is: 

1 /*y2 

Piyi <Y<y 2 ) = j dz 

(13) 

0 [(/i) = ^ ai\xi) 0 \(j){y - Xi)). (12) 

where ) projective operator \xi){xi\ and Zi ~ 

N {xi, cr^), is a normal distribution with mean Xi and vari¬ 
ance cr^. 

The above wave functions \4>{y ~ Xi)) have high variance 
and therefore they overlap each other. The higher the 
variance, the weaker the measurement process. If these 
normal wave functions do not overlap then the measure¬ 
ment is strong. Therefore we can control the measure¬ 
ment process by the choice of the variance. 

Proof: Use a simple Bayesian argument and the fact 
that tr(pP^ p is the probability that the £x ) 
tive measurement yields Xi. 1 

Notation: Denote the entropy of the above multi¬ 
normal distribution (Eq. 13) on Y by h£’'^{Y). 
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III. QUANTUM WEAK GAUSSIAN CHANNELS 


In this section we present and analyze QWGC and 
prove the main theorem of the paper. QWGCs are quan¬ 
tum channels where in the last step we use weak mea¬ 
surements instead of ‘strong’ POVM measurements. 

Suppose Alice is sending classical information to Bob 
using a quantum channel [12]. Let X denote a random 
variable such that p{X = i) = Pi- With probability pi 
Alice will pick a density matrix pi out of a given ensemble 
of density matrices. She will then send pi to Bob using a 
quantum (physical) channel. Define the density matrix: 


P = 

i 

At the output of the channel Bob is using a weak mea¬ 
surement device trying to guess the value of the random 
variable X. Bob is using an Hermitian operator that has 
the same eigenvalues as Alice’s letters. The measure¬ 
ment’s results yield a random variable Y. To compute 
the mutual information between Alice and Bob we could 
use a Holevo type of bound. However, since Bob is us¬ 
ing a particular measurement (strong projective on the 
continuous variable Y) we can compute H^’^{X : Y) 
explicitly using the definitions of section II above: 


HfP{X : F) = Hl»{Y) - 


In the following lemma we compute the maximal mu¬ 
tual information between Alice and Bob for all distribu¬ 
tions {pi}i and all densities {pi}i. In the main theorem 
that follows we will show that this maximal mutual in¬ 
formation is also the capacity of the channel. 

Lemma III.l 


ma.xH^’P{X : Y) = maxH{X : X + Z) 

Pi’Pi ^ X(A) 

where the R.H.S of the above equation is the maximal 
mutual information of two classical random variables 
X{A) and X(A) -|- Z where X{A) is any discrete ran¬ 
dom variable on {xi}, the eigenvalues of A, and Z is the 


normal random variable centered around 0 with variance 


Proof: 


a) maxH^’^iX : Y) < maxH{X : X + Z). 

Pi,Pi ^ X(A) 

Suppose we are given {pi} and {pi}. For each density ma¬ 
trix Pi, the distribution of Y is multi-normal (see lemma 
11.2 above) and therefore, by the concavity of entropy 
(see 11.3.5 of [2]): 


H{Z) = H{N{Q,a^)) < Hf^AY), (14) 

for all i. Define the following random variable X on the 
set {xi}f by p{X = Xi) = tr(/9M|^.p, then 


HfAY) - Y^P^Hf^AY) < H{X + Z)- H{Z), (15) 

where we have used the fact that H{N{Q,aA) = 
H{N{xi,aA) for all i. Gonclude now by recalling that 
H{X :X + Z) = H{X + Z)- H(Z) [2], 


b) maxH{X : X + Z) < maxHj’AX : Y). 

X{A) Pi,Pi ^ 

For each distribution X(A) on the L.H.S we find a 
density p on the R.H.S. Given a random variable X, such 
that p(X = Xi) = Pi, we define: 


Pi 

Then h'^'^'{Y) = H{N{xi,aA)- Let p = J^PiPi^ then: 


H{X :X + Z) = hJ’AX : Y), 
hence the above inequality. I 
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Main Theorem: 

measure the product by coupling n normal needles and 

The rate of transmission of classical information in the 
QWGC, that is, the channel between Alice and Bob is: 

strongly measuring all of them. By a standard classical 
argument we can separate H{X : X + Z) such tensors 
[2]. Thus: 

R = max{i/J'"(r) - ^ (T)}. (16) 

R> H{X : X + Z) 

Proof: Denote: 

Maximizing now over all a priori distributions p on A, 
we get: 

xj = 

R > maxi?(A : X + Z). 

X{A) 

We will prove that: 

By lemma III.l above: 

(a) R>Xa 

R > max{ijf ^(F) - (F)}, (17) 

ib) R<Xa 

which concludes the proof of (a). 

Proof of (b): We follow the argument in [12] with 

To prove (a) we use a reduction to classical argument. 
To prove (b) we use the subadditivity of entropy. 

some necessary adjustments. Suppose Alice and Bob are 
using one of 2"^ (n-product) codewords: 

Proof of (a): Given an a priori distribution p, Alice 
can send one of a 2”^ predetermined n-products: 

Ml = 0 ... 0p), = 

Mi= p\® ... 0 pI^ 

oni? oni? rjTiR 

M2nR = Pi 0 ... 0 = p^ 

ManR = pf 0 ... 0/9^'*, 

(this time the densities are not necessarily pure). 

Let M be the classical random variable defined by using 
each of the 2"^ codewords with equal probability. Sup¬ 

where Mi stands for the i-th codeword. Note that pi 
can be any density matrix over {|a:i)}f=iJ hence we can 
choose: 

pose Bob weakly couples a set of n needles on F = F®", 
where on each subspace he uses the procedure discussed 
in the previous section to a perform weak measurement 
with a single needle. 

II 

In what follows, we will first bound the global mutual 
information H{M : F): 

where each of the Mi codewords is e-typical according to 
the a priori distribution defined by X. Bob will weakly 

H{M-.Y)<n-xJ. 
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and then we will conclude by relating it to Bob’s error 
probability using the Fano inequality [2]. 

We begin with some definitions. In the previous section 
we defined the entropy (Eq. 13) for a general 

density matrix p on X = X{A). We now extend the def¬ 
inition to the product spaces X = (X(A))®". Consider 
the above codewords density matrix 'p on X\ 

p = ( 18 ) 

where p* is the codeword Mi as defined above. Following 
the weak coupling of measurement needles we can define 
to be the entropy of the following distribution 

on Y: 


p{{yi,o,---,yn,o) < {Yi,...,Yn) < {yip,...,yn,l)} 

Xi^ ,...,Xi^ 


Note that pj is a column in the matrix {Pj), and can be 
computed by tracing out the rest of the coordinates of p. 
By the subadditivity on Y (see [12] chapter 11.3.4): 


n 

HfP{Y)<J2Hj'^AYj) ( 22 ) 

i=i 

Subadditivity also implies that on the tensor product 
p* = Pi 0 ... 0 Pn (a row in the matrix pJ) we can write: 


n 

Hr\y)=j:HThY,). (23) 

i=i 

Combining Eqs. 19, 22 and 23 we can write: 


n 2 - . 

H{M : F) < - E (24) 



'I py^i 

_^e ^^dyi- / 
v27rcr 


(Vn 




~dyn 


where are in {l,...,d}. Now since M and Y 

behave as classical random variables we can write: 


H{M : E) = HfAY) - ^ ^ijJ>^‘(E). (19) 

i=l ^ 


To continue the proof of the theorem we shall need the 
following Lemma. 


<n max {iLj'"(r) - ^pfeiLj’"^(E)}. 

P=EPfcPfc ^ ^ A 


The last inequality is due to the fact that pj is a sum 
of density matrices on one coordinate (the j-th, see Eq. 
21 ). ■ 

We can now complete the proof of (b). Since Alice 
is sending each Mi with equal probability then the 
probability Perr that Bob is mistaken is the average: 



Lemma III.2 

H{M:Y)<n-xJ. (20) 


Proof: Consider the following density matrix on 

XAA): 


where is the probability for Bob (using weak mea¬ 
surements) to make a mistake, given that Alice had sent 
Mi (we do not have to know how to compute Perr)- Since 
R is the classical rate of transmission (Eq. 16), it is as¬ 
sumed that Perr Can be arbitrary small for large enough 
n. By the Fano inequality on the random variable M [2], 
we know that: 


Pj 


' “ 2^ Onfl Pj ■ 


i=l 


( 21 ) 


H{Perr) + Perr ' log{d^) > H{M/Y), 






6 


where we have used the fact that measuring Y yields no 
more than d” + 1 values for M. Hence, 


Moreover, let 


max E{X^) = P; 
X(A) 


( 26 ) 


nPerr ' log{d) > H{M) — H{M : Y) — iJ(Perr) 


= nR- H{M : Y) - H{Perr)- 


Since H{M : Y) < (by Eq. 24 above) we have (for 
large enough n): 


Then the capacity of the weak channel will be bounded 
from above by: 


max H{X : X P Z) < hog ( ^ + 1 V (27) 
X{A) 2 V / 


w 


p > 

err _ 


p- 
log{d) 


thus concluding that R < otherwise P^rr > 0. ■ 

We will now show a simple bound on the rate of trans¬ 
mission using the above theorem. Let the random vari¬ 
ables X on the space of eigenvalues of H, Z and Y on 
the needles space, be defined as above {Z before the 
weak measurement and Y following the weak measure¬ 
ment). Let E{X) denote the expectation value of X. 
Since E(Z) = 0 we can write: 


E{X + Zf = E{X^) + E{Z^). 


IV. DISCUSSION 


We have defined and analyzed the channel capacity of 
the QWGC. Very naturally, it turns out that the maximal 
capacity depends on the maximal second moment of the 
operator used for creating Alice’s letters divided by the 
variance of Bob’s measuring needle. 

One application of the theorem could be as follows: 
suppose Eve is eavesdropping a channel between Alice 
and Bob. To reduce the amount of interference she might 
want to use weak measurements. Suppose Eve knows 
the protocol, that is, the set of letters Alice and Bob 
are using. Eve is introducing a noise into the channel 
and Bob is trying to measure the output with a POVM 
measurement. 


The noise introduced by Eve can be described by a 

x, • 11 1 XT, x XU 1 X -u X- • • trace preserving operator SYip) on p as discussed below 

it IS well known that the normal distribution maximize o f a ^ 

the entropy of all distributions of the same variance (see 

[2] chapter 9), therefore: 


H{X PZ)< -log {2TTe{E{X‘^) + cr^)) 


where E{Z‘^) = and the log function has base 2. Also 


7Wl 





Q 

•-A J 




Y 

Py 




H{Z) = -log (27recr^) . 


Fig-1- Schematic illustration of the ‘weak channel’ 


Therefore 

Suppose the system Q is described by the density ma¬ 
trix p, and the system Y (the needle) is described by the 
pure (and continuous) Gaussian vector state \4))\ 


H{X-.X + Z)<hog(h^h^ 


(25) 
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l<^(2/)) = (27rcr^) 4 y e y \y)dy. 

First, Eve weakly couples p and |(/))(</>| by using a uni¬ 
tary operator U = U{A)-. 


c/(p® I </.) (</>!) c/t 

Next Eve acts on the product state by a set of projec¬ 
tive operators Py = |y)(y|, this corresponds to a strong 
measurement of the needle’s space which slightly alters 
the state of the measured system but with no collapse 
(see details in section II). 

Using the standard operator sum representation [12] we 
can define: 


£j{p) = j PyU{p®\cl>){cl>\)U^Py^dy (28) 
or 


(p) = J p{y)Pvdy (29) 

where 


PyU{p®\<t>)mU^P^y 

Pv = -^- hr- 

tr(P,C/(p®|0)(<^|)C/tpt) 

and 


P{y) =tr{PyUiP<£>\^){(l^\)U'‘Py)- 

Bob is now using a POVM measurement. By the HSW 
theorem, the rate of transmission in the channel between 
Alice and Bob after the weak eavesdropping is: 


{p^))}. (30) 


We can compare this capacity with the capacity of the 
same channel without eavesdropping. The reduction in 
the capacity will be the result of the information ‘leaking’ 
through the weak channel between Alice and Eve. In fact 
we can estimate this reduction using the capacity of the 
channel between Alice and Eve. 

In the above quantum weak Gaussian channels we im¬ 
pose restrictions on the measurement at the end point 
of the channel; a weak measurement using an Hermitian 
operator A. In general if Alice and Bob could use any 
Hermitian operator the capacity would be: 


max max H{X ■. X + Z) 

A x{A) 

where the first maximum is taken over all Hermitian op¬ 
erator with distinct eigenvalues. 

There is reason to believe that one could also extend 
other theorems on classical Gaussian channels to the 
quantum case, such as the ‘water filling’ theorem [2]: 
given a set of quantum weak Gaussian channels between 
Alice and Bob, each with its own capacity, what will be 
the order of usage of the channels to maximize the rate 
of information passage between them. 


We hope the above results will open the way to further 
analysis of weak measurements and better understanding 
of their pros and cons. 
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